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Abstract 

The induction of the lac operon follows cooperative kinetics. The first mechanistic 
model of these kinetics is the de facto standard in the modeling literature (Yagil & 
Yagil, Biophys J, 11, 11-27, 1971). Yet, subsequent studies have shown that the model 
is based on incorrect assumptions. Specifically, the repressor is a tetramer with four 
(not two) inducer-binding sites, and the operon contains two auxiliary operators (in 
addition to the main operator). Furthermore, these structural features are crucial for 
the formation of DNA loops, the key determinants of lac repression and induction. 
Indeed, the repression is determined almost entirely (>95%) by the looped complexes 
(Oehler et al, EMBO J, 13, 3348, 1990), and the pronounced cooperativity of the 
induction curve hinges upon the existence of the looped complexes (Oehler et al, 
Nucleic Acids Res, 34, 606, 2006). Here, we formulate a model of lac induction taking 
due account of the tetrameric structure of the repressor and the existence of looped 
complexes. We show that: (1) The kinetics are significantly more cooperative than 
those predicted by the Yagil & Yagil model. The cooperativity is higher because the 
formation of looped complexes is easily abolished by repressor-inducer binding. (2) 
The model provides good fits to the repression data for cells containing tetrameric 
(or mutant dimeric) repressor, as well as the induction curves for 6 different strains 
of E. coli. It also implies that the ratios of certain looped and non-looped complexes 
are independent of inducer and repressor levels, a conclusion that can be rigorously 
tested by gel electrophoresis. (3) Repressor overexpression dramatically increases the 
cooperativity of the induction curve. This suggests that repressor overexpression can 
induce bistability in systems, such as growth of E. coli on lactose, that are otherwise 
monostable. 
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Figure 1. Kinetic scheme of the Yagil & Yagil model (lYagil and Yagi 3. ll97lh . Here, 
X denotes the inducer, R denotes the repressor, and O denotes the operator. 



1 Introduction 



Genetic switches plays a fundam ental role in development and evolution (ICarroll et al 
2005 : Ptashne and Gann . 20021 ). The development of embryos is now known to 
be orchestrated by an array of genetic switches. There is growing belief that 
the biodiversity of organisms reflects the evolution of the regulatory genes 
controlling these genetic switches. 



The lac operon is a paradigm of the mechanism by which genetic switches are 
regulated. Key mechanisms of gene regulation, such as negative and positive 
control by t he repressor and C AP, respectively, were revealed by studies of the 
lac operon ( jMuller-Hilli I1996T ). Not surprisingly, the lac operon has been, and 
continues to be, the system of choice for researchers interested in the dynamics 
of gene regulation (ILaurent et all 120051 ) . 



It has been known for many years t hat the lac induct ion rate is a sigmoidal 
function of the inducer concentration (lHerzenbergj . ll959l ). The first mechanistic 
model of these kinetics was based on the following assumptions (Fig. [T]): 

(1) The lac operon contains one operator. 

(2) The lac repressor has two inducer-binding sites. 

(3) Inducer-bound repressor (R ■ X, X ■ R ■ X) cannot bind to the operator. 

The first assumption was supported by the prevailing knowledge of the lac 
operon. There was no direct evidence for the last two assumptions — they were 
made because they yielded sigmoidal kinetics. Indeed, the above assumptions 
imply that the induction rate is proportional to the expression 



1 + K xl x + K xl K x2 x 2 
1 + K x r t + K xl x + K xl K x2 x 2 

where x is the inducer concentration; K x i,K x2 are the association constants 
for binding of the first and the second inducer molecules to the repressor; K± 
is the association constant for repressor-operator binding; and r t is the total 
concentration of the repressor. 

Yagil & Yagil also performed an extensive study of the extent to which their 
model captured the data. They showed that in some instances, the data could 
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Figure 2. The structure of the lac repressor (adapted from Muller-Hilll . 1996 . 
Chap. 3.4). The open circles represent free inducer-binding sites. The binding of 
an inducer to a dimer (closed circle) changes the relative orientation of the two sub- 
domains of the core, thus separating the headpieces and abolishing their ability to 
bind to an operator. 



be fitted by the simpler expression 



1 + K xl K x2 x 2 



1 + Km + K xl K x2 x 2 



(2) 



which does not contain the linear term, K x \x. In yet other cases, the data 
could not be fitted unless eq. (DQ) was used. Nev ertheless, eq. (El) has become 
the de facto standard in the modeling literature (IChung and Stephanopoulosl . 
19961 : lOzbudak etldl l2004h . 



Since the publication of Yagil & Yagil's paper, studies have shown that as- 
sumptions (l)-(3) of the Yagil & Yagil model are not consistent with the 
structure of the lac operator and repressor. Specifically, the lac operon con- 
tains not one, but three operators; the repressor contains not two, but four 
inducer-binding sites; and finally, inducer-bound repressor can bind to the op- 
erator. Furthermore, these structural features have a profound effect on the 
repression and induction of the lac operon. 



In vivo, the lac repressor is a tetrameric molecule (IBarry and Matthews! , Il999h . 
which can be viewed as a "dimer of dimers" (Fig. [2]). Each monomer con- 
tains a headpiece that can bind to the operator, a core containing an inducer- 
binding site, and an oligomerization domain that mediates the linking of the 
two dimers. If a repressor dimer is inducer-free, its headpieces can interact 
strongly with an operator. This interaction is reduced if the dimer is inducer- 
bound, because inducer binding changes the relative orientation of the two 
subdomains o f the core, th us increasing the distance between the headpieces 
of the dimer (ILewisl . 120051 . Fig. 17). Kinetic studies suggest that the binding 
of even o ne inducer molecul e to a dimer abolishes its ability to bind to an 
operator fjOehler et all 120061 ) . It is therefore clear that the repressor molecule 
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Figure 3. The arrangement of the lac operators (not drawn to scale). The main 
operator, Ox, lies within the lac promoter. The auxiliary operator, O2, lies within 
lacZ, the gene encoding /3-galactosidase, and the auxiliary operator, O3 is adjacent 
to the binding site for CAP. 

has 4 identical inducer binding sites, and inducer-bound repressor can bind to 
the operator, provided one of its dimers is inducer-free. 

It has also been found that in addition to the main operator, denoted Ox, 
there are two auxiliary operators, denoted O2 and O3 (Fig. [3]). The auxiliary 
operator, 2 , located 401 bp downstream of Ox, lies within lacZ, the gene 
encoding /3-galactosidase. The auxiliary operator, 3 , located 92 bp upstream 
of Ox, is adjacent to the CAP binding site. Given these locations, one expects 
the transcriptional repression to increase in the presence of the auxiliary op- 
erators. If the repressor binds to O2, it can hinder the transcription of the 
operon; if it binds to 3 , CAP cannot attach effectively to its cognate site. 
It turns out that the repression is indeed higher in the presence of the auxil- 
iary operators, but not because these operators have a strong affinity for the 
repressor. Instead, they increase the repression by a subtle interaction that 
stabilizes the binding of the repressor to Ox- 

This interaction was revealed by mea suring the repressio n in cells contain- 



ing various combinations of operators (jOehler et all Il990l ). The repression is 
defined as the ratio 

K = ^f^- (3) 

e lx=0 

where x is the concentration of a gratuitous inducer (IPTG in these experi- 
ments), and e is the specific /3-galactosidase activity measured during expo- 
nential growth of lacY~ cells on a mixture of IPTG and a carbon source that 
cannot induce lac transcription (glycerol in these experiments). It provides a 
measure of the transcriptional inhibition in the absence of the inducer: 1Z is 1 
if there is no inhibition, and becomes progressively higher with the strength 
of the inhibition. Oehler et al observed that (Tabled]): 

(1) In the absence of the auxiliary operators, the repression is only 18. How- 
ever, it increases dramatically if O2 or O3 are also present (~40- and 
~25-fold, respectively). 

(2) In the presence of only O2 or O3, the repression is similar that observed in 
cells lacking all three operators. Thus, 2 and 3 have almost no affinity 
for the repressor. 

It follows that the increased repression observed in the presence of 0\ and 
O2 (or O3) does not occur simply because the auxiliary operators have 
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Table 1 

Repr ession observed in the presence of various combinations of the opera- 
tors dOehler et all fl~99ol. Fig. 2). 



Combination of operators 


Repression 


Combination of operators 


Repression 


Oi 


18 




1 


Oi, o 2 


700 


o 2 , o 3 


1.9 


Oi, o 3 


440 


Oi, o 2 , o 3 


1300 


2 


1 


No operators 


1 



a strong affinity for the repressor — instead, there is some interaction 
between the operators. 

(3) The repression in the presence of 2 and O3 is also similar to basal levels. 
It follows that the interaction primarily involves the pairs, Oi,0 2 and 
Oi,Oz — interactions between 2 and O3 make almost no contribution 
to the repression. 

(4) In the presence of all three operators, the repression is only 2- or 3-fold 
higher than that observed in the presence of the pairs, Oi, 2 and Oi, O3. 
Thus, the presence of either one of these two pairs is sufficient for the bulk 
of the repression. 

Oehler et al argued that the interaction between the operators reflects the 
formation of DNA loops. 

DNA loops can form only if the repressor is completely free of inducer. In 
this case, the binding of one of the repressor dimers to an operator brings 
the other (free) dimer close to the remaining the remaining two operators. If 
one of these operators is free, the free dimer can bind to it, thus forcing the 
intervening DNA to form a loop (Fig. Hj). 



Given the above mechanism for DNA loop formation, Oehler et al explained 
their data as follows. The repressor binds primarily to 0\. The 0\ ■ R complex 
thus formed is rapidly converted to a stable DNA loop by interaction with 
2 or O3. The conversion to a loop is rapid because it is driven by the "local 
concentration" of 0\ ■ R within small spheres having radii equal to the inter- 



operator distances of 401 and 92 bp (jOehler et all Il994l . Fig. 7). The loop 



is stable because even if thermal fluctuations cause the repressor to detach 
from, say, Oi, weak interaction of the repressor with 2 or O3 keeps it within 
a smal l neighborhood of Oi , thu s increasing the probability that it rebinds 
to 0\ (IPtashne and Gannl . I2002I . p. 20). In other words, the local concentra- 
tion effect increases the "on" rate for loop formation, and the rebinding effect 
decreases the "off" rate for loop formation. The net result is a high associa- 
tion constant for loop formation, a fact that is confirmed by the parameter 
estimates (Section [3|) . 
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Figure 4. The formation of a DNA loop ( from iMuller-Hilll . 119961 . Chap. 3.4). 

The above explanation assumes that (a) despite the low affinity, the repressor 
does bind to the auxiliary operators, and (b) the stability of the loop rests 
upon the proximity of the main and auxil iary operators. Bot h assumptions 
were confirmed in subsequent experiments ( lOehler et all Il994l ) . It was shown 
that the repressor binds weakly to the auxiliary operators, and there is no 
repression if the auxiliary operators are moved far away (>3600 base pairs) 
from 0\. 



Vilar & Leibler formulated a s tatistical thermodynamic model to account for 
the foregoing repression data ( IVilar and Leiblerl . 120031 1 . The model assumes 
that there is one main and one auxiliary operator, and transcription occurs if 
and only if the main operator is free. Given these assumptions, they showed 
that the repression is given by the expression 

jy e -AG m _|_ jy e -AG m -AGa-AG ; _|_ _/y(_/y _ ]^g— AG m -AG a 
U=1+ 1 + Ne -*G a > ( 4 ) 

where N is the number of repressor molecules per cell; AG m , AG a are the 
free energy changes (normalized by RT) due to binding of the repressor to 
the main and auxiliary operator, respectively; and AGi is the free energy 
change of loop formation. Equation j4]) captures the repression of pairs of 
operators for suitable values of N, AG m , AG a and AG/. Furthermore, the 
term, jVe _AGm_AGa_AGi , explains why DNA loops are so stable despite the 
weak repressor-operator binding. If the magnitude of the looping free energy, 
| AG 1 1, is sufficiently large, it can overcome the effect of small |AG m | , |AG a |. 

The above discussion shows that DNA looping strongly influences the mag- 
nitude of the repression (observed in the absence of the inducer). However, 
insofar as the formulation of dynamic models is concerned, it is of more inter- 
est to ask if DNA looping influences the kinetics of induction (observed in the 
presence of the inducer). It turns out that this is indeed the case. Recently, 
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increases the cooperativity of the induction 
(a,b) The induction curves for cells containing 
(a) no auxiliary operators, and (b) mutant dimeric repressor. The data was fitted 
with eq. (f28l) and the parameter values estimated by Oehler et al. (c) The induction 
curve for cells containing all three operators and tetrameric repressor. The data 
was fitted with eq. ([45]) and the parameter values in Table [2J 



Oehler et al c ompared the induct ion kinetics in the absence and presence of 



DNA looping (lOehler et all 120061 ). They abolished DNA looping by deleting 



the DNA encoding the auxiliary operators, or mutating the DNA encoding 
the oligomerization domain of the repressor (this results in the production of 
mutant dimers that cannot form the tetrameric structure necessary for DNA 
looping). In both cases, the induction kinetics were hyperbolic at all but the 
smallest inducer concentrations (Figs. [5h,b). In sharp contrast, the kinetics 
were strongly sigmoidal in the presence of DNA looping (Fig. [5k). The au- 
thors concluded that the "sigmoidality of the induction curve of the wt lac 
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system reflects cooperative repression through DNA loop formation." 

These experiments show that DNA looping massively amplifies the coopera- 
tivity of the induction kinetics. The goal of this work is to understand this 
phenomenon quantitatively. It is clear that we cannot appeal to the Yagil & 
Yagil model, since it does not account for the auxiliary operators and the at- 
tendant DNA looping. Here, we formulate a model of lac induction taking due 
account of both features. We find that 

(1) In the absence of DNA looping, the kinetics are formally similar to eq. (PQ) , 
the general form the Yagil & Yagil model. However, in the presence of 
DNA looping, the kinetics are significantly more cooperative. 

(2) In wild-type cells, they depend on powers of x as high as x 4 . The cooper- 
ativity increases markedly because looped repressor-operator complexes 
are very sensitive to the inducer concentrations. 

(3) If the repressor is overexpressed in wild-type cells, the kinetics become 
even more cooperative — they depend on powers of x up to x 6 . Under 
these conditions, multiple repressors are bound to the operons. These 
multi-repressor operons are even more sensitive to inducer concentrations 
than operons with one repressor typically found in wild-type cells. 

(4) The model provides good fits to the induction curves for 6 different strains 
of E. coli. More importantly, however, the model implies the existence 
of specific scaling relations between looped and non-looped complexes. 
These relations, which can be tested by gel electrophoresis, provide a 
more stringent test of the model. 



2 The model 

We begin by enumerating all possible states of the lac operon. We then de- 
fine the transcription rate in terms of the concentrations of the particular 
states that allow transcription. Finally, we derive the governing equations 
that determine the concentrations of these states as a function of the inducer 
concentration. 



2. 1 States of the lac operon 

We denote the free repressor (i.e., repressor not bound to an inducer or oper- 
ator) and its concentration by R and r, respectively. Since the free repressor 
has 4 inducer binding sites, there are 15 possible repressor-inducer complexes 
(Fig. [6]). We denote the concentrations of repressor-inducer complexes con- 
taining 1, 2, 3, and 4 inducer molecules by r\, r 2 , r 3 , and r 4 , respectively. 
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Figure 6. All possible states of repressor-inducer complexes. Here, R and X denote 
the repressor tetramer and inducer, respectively. The free repressor is on the left. 
Repressor-inducer complexes containing one free dimer are shown in red. 

We assume that a repressor dimer can bind to an operator if and only if it 
contains no inducer. It follows that: 



(1) In addition to the free repressor, there are six repressor-inducer com- 
plexes that can bind to the operator (shown in red in Fig. [6]). We denote 
any inducer-bound repressor with one free dimer by R! , and the total 
concentration of such complexes by r' . 

(2) Although both R and R' can bind to an operator, only operator-bound 
R can form DNA loops (Fig. BJ. Operator-bound R' lacks the free dimer 
necessary for forming a DNA loop (Fig. [2j). 

These two facts will be crucial for explaining the influence of DNA looping on 
the induction kinetics. 



The lac operon can be in numerous states. There are 14 possible states if 
we assume that only R can bind to an operator (Fig. [3). Several additional 
states are feasible because R' can also bind to an operator. To enumerate these 
states systematically, it is convenient to classify them based on the number of 
repressors bound to an operon. We shall refer to operons containing 0, 1, 2, 
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Free operon Unary operons Binary operons Ternary operon 




Figure 7. All possible states of the lac operon when only free repressor is permitted 
to bind to the operators. The black arrows show the reactions which a repressor 
binds to operator Oj, and K. L denotes the corresponding association constant. The 
red arrows show the reactions in which a repressor-bound operator, Oi — R, forms 
a loop by binding to a free operator Oy, Ky and Kij denote the corresponding 
association constants for unary and binary operons, respectively. 

and 3 repressors as free, unary, binary, and ternary operons, respectively. 

The operon can be free in only 1 way. We denote the concentration of free 
operons by o. 

Unary operons can exist in 9 different states. Six of these correspond to states 
in which either R or R' is bound to one of the operators, say, Oj. We denote 
the concentrations of these states by Oj and Oi> , respectively. The remaining 
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three states are obtained because free repressor bound to 0« can interact with 
another operator, Oj, to form a DNA loop. We denote the concentration of 
such looped states by o<j.. For example, denotes the concentration of the 
looped state obtained when a free repressor bound to 0\ interacts with 2 , or 
a free repressor bound to O2 interacts with 0\. These definitions imply that 

U — Oi + 2 + 3 + 0\i + 2 ' + Oy + 0£ + Ofe + 0£j, (5) 

where u denotes the total concentration of the unary operons. 

Binary operons can exist in 18 different states. Twelve of these correspond 
to the states obtained when R or R' bind to any two of the 3 operators. We 
denote the concentrations of such states by o^ , cvj, o^v, ctyy, where the indices 
i,j represent the two operators to which R or R' are bound, and the symbol ' 
above an index indicates that R', rather than R, is bound to the correspond- 
ing operator. The remaining 6 complexes are the looped states obtained when 
the free dimer of an operator-bound free repressor interacts with another free 
operator. We denote the concentration of such looped complexes by overlay- 
ing the symbol^on the subscripts representing the two interacting operators. 
For example, o 3 ^ 2 and o 3 ,^ denote the concentrations of the states in which 
operator 3 is bound to R and R' respectively, and operators 1, 2 interact by 
looping. It follows that 

b = (0 3 i + 3 'i + O31/ + O3/1/) + (O12 + V2 + X2 ' + Oil 2 ') 
+ (032 + 03'2 + 032' + O3/2O 

+ (°312 + °3l2' + °3f2 + °3'f2 + 3 T2 + °3^) » ( 6 ) 

where b denotes the total concentration of the binary operons. 

Ternary operons can exist in 9 possible states, none of which are looped be- 
cause loops cannot form in ternary operons. The concentrations of these states 
are denoted by o..., where each • contains an integer of the form % or %' indicating 
whether R or R! is bound to the i-th operator. Evidently 

t = (0312 + 3 i'2 + O312' + 3 i' 2 ') + (03/12 + 3 /i/ 2 + 3 'i 2 ' + Oyi'2') , (7) 

where t denotes the total concentration of the ternary complexes. 



2.2 Transcription rate 



Oehler et al have postulated that: 



(1) Binding of t he repressor to ^ blocks transcription by occluding RNA 
polymerase (jMiiller-Hilli Il99fil Chap. 1.18). 
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(2) Binding of the repressor to O2 has no effect on the transcription rate. 
This is not because the repressor rarely binds to O2: Even if the repressor 
is ov erexpressed 90-fold, 2 -containing cells show no measurable repres- 



sion ( jOehler et al.Ul990l Table I). This suggests that CVbound repressor 
cannot obstruct the movement of RNA polymerase. 
(3) Binding of the repressor to O3 does not block transcription. It merely re- 
duces (deactivates) the transcription rate by preventing CAP from bind- 
ing to the repressor. 

This hypothesis is based on the following argument. If repressor-bound 
3 blocked transcription, the repression in 3 -containing cells would in- 
crease monotonically with the repressor level. However, if t he repressor is 



overe xpressed in these cells, the repression saturates at 25 (lOehler et al 



1994 p. 3351). 



These postulates imply that the transcription rate is proportional to 



m O ( Oo Oo'\ , / O3 Oy O30 Oyo Oy>' Oyoi 0$> 
T =—+( — + — ) + dl — + — + — + — + — + — + 

O t \O t O t J \ O t O t O t O t O t O t O t 



where d < 1 is a parameter accounting for deactivation of transcription by 
repressor-bound O3. 



2.3 Governing equations 



To determine the concentrations of the various states, we assume that 

(1) The total concentrations of the repressor and operator, denoted r t and 
o t , are constant. 

(2) The system is in thermodynamic equilibrium, and satisfies the principle 
of detailed balance (i.e., the net rate of every reaction is zero). 

(3) The binding of R or R! to an operator does not affect the affinity of the 
remaining free operators for R and R'. Hence, one can define Ki and Ki> 
as the association constants for the binding of R and R' to O i7 regardless 
of the state of the remaining operators. Evidently, = Ki/2, since R 
contains two inducer-free dimers, both of which can bind to Oj, whereas 
R' contains only 1 inducer-free dimer. 

We denote the association constants for formation of unary and binary 
loops by Kij and K^, respectively (Fig. [7]). 

(4) All four inducer-binding sites on the repressor are identical and indepen- 
dent. We denote the association constant for binding of an inducer to any 
one of these sites by K x . 
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Assumption 1 implies the conservation relations 

(r + ?~i + r 2 + r 3 + r 4 ) + u + 2b + 3t = r t , (8) 

o + w + 6 + t = o t , (9) 

where the factors 2 and 3 in (jHj) account for the fact that the binary and 
ternary operons contain 2 and 3 repressors, respectively. 

Assumptions 2 and 3 yield the equilibrium relations 

Oi = Kior o v = \KiOr' o~. = K^K^or = K^KjOr, 

Oij = KiKjor 2 Oi>j = Oiji = \KiKjOrr' o V y = \KtKjO (r'f , 

and 



o 3l2 = K 2 K 3 K 3l or 2 = K 2 K x K X3 ar 2 o^, = \K 2 K 3 K 3l orr' = \K 2 K x K 13 orr' 



312' 



Og f2 = K 3 K x K X2 or 2 = K 3 K 2 K 21 or 2 = \K 3 K x K X2 orr' = \K 3 K 2 K 2l orr\ 



312 



3'12 



= K x K 3 K 32 or 2 = K x K 2 K 23 or 2 = \K x K 3 K 32 or 2 = \K x K 2 K 23 or 2 , 

0312 = K x K 2 K 3 or 3 
oyv2' = \K x K 2 K 3 o (r'f 



o 3 'i2 = o 3 i'2 = 0312' = \K x K 2 K 3 or 2 r\ 

3 1'2' = 3 / 12 = O3/1/2 = \K x K 2 K 3 0r {r'f 



where the concentrations of the looped species have two representations (e.g., 
0^ = KijKior = KjiKjor) because these species can be formed by two differ- 
ent pathways (Fig. [7]) If] 



These equilibrium relations imply that eqs. (EH71) can be rewritten as 



u = o 



(K l + K 2 + K 3 )ir + -\ + (K X K 12 + K,K 13 + K 2 K 23 ) r 



(K X K 2 + K X K 3 + K 2 K 3 ) r + - 



+ K x (k 2 K 23 + K 2 K 13 + K 3 K 12 ) [r + 



(10) 



:id 



t = o- K X K 2 K 3 r + 



12) 



2 Since the system is at equilibrium, thermodynamics demands that the free energy 
changes of the two pathways be the same (K^Kij = KjKji in the above example). 
The principle of detailed balance ensures that these thermodynamic constraints are 
satisfied (?). 
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Assumptions 2 and 4 imply that the total concentration of all the complexes 
shown in Fig. [6] is given by 



r + ri + r 2 + r 3 + r 4 = r (1 + 



(13) 



and the total concentration of repressor-inducer complexes with one free dimer 
is 



(AK x x + 2K 



2 2 



r + — = r (1 + i^x) 



(14) 



The se two equations fol low immediately from statistical thermodynamic the- 
ory ((Ackers et all Il982h . 



Substituting (fT0HT4l) in 



yields the two governing equations 



r{l + K x x) 

+or [(K x + K 2 + K 3 ) (1 + K x xf + K X K X2 + K X K X3 + K 2 K 23{ 
+2or 2 [(K X K 2 + K X K 3 + K 2 K 3 ) (1 + K x xf 
+K X (R 23 + K 13 + K 12 ) (1 + K x x) 2 ] + 3or 3 (1 + K.x) 6 K X K 2 K 3 = r t , (15) 



o + or 



(K x + K 2 + K 3 ) (1 + K x xf + iriKi 2 + K X K X3 + if 2 K 23 
+or 2 \{K X K 2 + ^1^3 + #2# 3 ) (1 + M 



+^1 (^ 23 + K13 + tf 12 ) (1 + i^x) 

containing the 3 variables, r, o, x. 
The equilibrium relations imply that 

T 



+ 3or 3 K x K 2 K 3 (l + K x x) 6 = o t , (16) 





Of 



1 + if 2 r (1 + fQx) 2 + d [K 3 r (1 + K x xf 
+K 2 K 3 r 2 (1 + i^x) 4 + K 2 K 23 r) . 



Eqs. (rT5l ) - (fT6l) yield o and r as a function of x, which can be substituted in 
the above expression to obtain T as a function of the inducer concentration. 



2.4 Scaled equations 



It is convenient to define the dimensionless variables 

- r - -is 
P = — ,v = — ,X = K xX, 

n o t 
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and the dimensionless parameters 



Ki = Kir t , i = 1,2,3, 

Oil = Kx + K 2 + K3, 

51 = K X K l2 + «iiCi 3 + ^2-^23, 

a 2 = kik 2 + kik 3 + /c 2 ac 3 , 

5 2 = «1 (/«2-^23 + ^2-^13 + ^3-^12) 
«3 = «1«2«3, 



Of 



The transcription rate is then proportional to 



T 



v 



K 2 p(l + X ) 2 + d{n 3 p(l + X ) 2 } 

+ K 2 K 3 p 2 (l + X) 4 + «2-^23p}] 



and eqs. (fToj)-(fTol) become 

p (1 + X f + lou [pfM + 2p 2 f 2 ( X ) + 3p 3 / 3 (x) 
u[l + pf 1 (x)+p 2 f2{x)+p 3 fs{x) 

where 



(17) 



(18) 
(19) 



A(x) = «i (1 + x) 2 + Si, 

/ 2 (X) = « 2 (1 + X) 4 + « 2 (1 + X) 2 , 

fs(x) = « 3 (1 + X) 6 • 

As we show below, the parameters, aij and S i? are related to the repression due 
to repressor-operator binding and DNA looping, respectively. The parameter, 
10, is typically quite small. In wild-type Escherichia coli, uo w 0.2 since each 



cell c ontains 10 repressor molecules and no more than 2 operators (jMuller-Hill , 
19961 . Chap. 3.2). In many experiments, the repressor is overexpressed (>50 
molecules per cell), so that to < 0.02. 



3 Results 



In what follows, we shall determine the values of a« and Oj by appealing to 
the repression data. It is therefore useful to express the repression in terms of 
the model. 

To this end, we begin by observing that during exponential growth in the 
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presence of IPTG and glycerol, the mass balance for /3-galactosidase yields 



de , , ( , \ n r e (x) 
= r e {x) - (r g + k e )e = 0- 



dt cv 7 V y • -) r g + k- J 

where x is the concentration of IPTG, r e (x) is the corresponding specific rate of 
/3-galactosidase synthesis, r g is the maximum specific growth rate on glycerol, 
and k~ is the rate constant for /3-galactosidase degradation. Since r g + k~ is 
a fixed parameter, e is proportional to r e , and j3]) becomes 



r e (0) 

It follows from ffT7l) that 



7? — — _ — = (20) 

T(0) v(0)[l + K 2 + d(K 3 + K 2 K 3 + K 2 K 23 )Y K ' 

where we have assumed that p(0) = 1, and at large inducer concentrations, 
p = 0,1/ = 1. 

Oehler et al measured the repression in the presence of various combinations 
of operators (Table [T]) . We shall distinguish these cases by using subscripts to 
denote the particular combination of operators being considered. Specifically, 
IZi will denote the repression in cells containing only the i-th operator, IZij 
will denote the repression in cells containing the z-th and j-th operators, and 
7?.3i 2 will denote the repression in cells containing all 3 operators. 

We begin by considering the special cases in which there is no DNA looping, 
and then proceed to the more general case that accounts for DNA looping. 



3.1 No DNA looping 



In the experiments, DNA looping was abolished by deleting the auxiliary op- 
erators or mutating the locus for the oligomerization domain of the repressor. 
Here, we consider the first case. The case of mutant dimers is discussed in 
Appendix [A] 



In the absence of the auxiliary operators, n 2 = k 3 = 0, so that 

«i = Kx, Si = a 2 = Si = «3 = 0, 
and eqs. (fT7l)-(fT9l) become T = u, and 



P (1 + X) 4 + ujupnx (1 + x) 2 = 1, 

\ + pKl {i + x) 2 } = 1. 



(21) 

(22) 
(23) 
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We can get v(x) from these equations by eliminating p and solving the result- 
ing quadratic. However, this solution is cumbersome and offers little insight. 
Instead, since uj is small, we appeal to perturbation theory (Appendix iBl), 
which formalizes the following physical argument. 

Since the number of operons per cell is small compared to the number of 
repressors per cell, one can assume, as a first approximation, that the fraction 
of operon-bound repressors is negligibly small compared to the fraction of 
free repressors, i.e., uj = 0. Equations (I22|) - (l23l) then yield the approximate 
zeroth-order solution 



1 



Po 



V 



(i+x) 



4 ! 



1 + Kipo (1 + X) 



2 ' 



(24) 
(25) 



To estimate the error of the approximation, we acknowledge that the fraction 
of operon-bound repressors is small but not zero. We assume furthermore that 



this fraction can be estimated by the expression, ujis p Ki (1 + xY 
the resulting equations 



and solve 



P (1 + xY + uv Q p Ki (i + xY 
l + p Kl (i + xf 



1. 



to obtain the improved first-order solution 



P 
v 



Mi 



?y 



u )}+O(uj 2 ) 



(26) 
(27) 



It follows from ( 1271 ) that the relative error of vq is approximately 



v-vo UJ (1 - u Y 



V 



1 + u (1 - Uq) 



< 



UJ 



l + uo 



Since, uj < 0.2, the zeroth-order solution is accurate to within 100uj/(1 + uj) ~ 
15% in wild-type cells, and even more accurate in repressor-overexpressed cells. 
Henceforth, we shall assume that eqs. (I24l) -(l25l) are a good approximation to 
the exact solution, so that 



T{x) = v(x) 



2 ! 



(28) 



which is formally identical to eq. (Dp) with K x l = 2K X , K x l K x 2 = the 
special case of the Yagil & Ya gil model correspondin g to identical and inde- 
pendent inducer-binding sites (lYagil and Yagill . Il97ll . p 19). 
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Figur e 8. Estimati on of ki and a c.s by fitting the repression data from lOehler et al.1 
1990, Table I, and behler et all 11994 . Fig. 1, to eqs. 



It follows from (1281 ) that induction is cooperative even in the absence of DNA 
looping. Indeed, since T(x) has a unique inflection point at \ - 



'ax/3 - 1, 

T = 1/4, the kinetics are cooperative for all inducer concentrations such that 
< T < 1/4. In the particular case of Fig. [5h, the kinetics are cooperative for 
all inducer concentrations in the rang e 0-50 uM, which is significantly higher 
than the 0-5 /xM range reported in lOehler et all I2006L based upon visual 
inspection of the curve. 

The parameters, Kx, K x x, can be estimated from the induction curve by ob- 
serving that (l28l) implies 



T 



+ 



-x. 



If the model is correct, a plot of \JT /{\ — T) vs x will be a straight line, and 
Ki, K x can be estimated from the slope and ^-intercept. The induction curve 
shown in Fig . J5h y ields a straight line with K\ = 227 and K~ l = 6.7 /iM 
flOehler et all 120061 . Fig. 4B). 



The value of Kx can also be estimated from the repression data. Indeed, it 
follows from §M) that 

w i = r7o)= 1 + K i- ( 29 ) 

Fitting the repression at various overexpression levels to this equation yields 
K\ = 30 for wild-type cells (Fig. Eh)- This is ~7-fold lower than the value 
estimated above because the induction curve was obtained with repressor- 
overexpressed cells. 

Although eq. (1251) was derived for cells containing the main operator Ox, anal- 
ogous expressions are obtained for cells containing an auxiliary operator, i.e., 



v = l/[l + ^/(l + x) 2 ] for 



2, 3. Equation (1201 ) then implies that the 
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repression in 03-containing cells is 



iz, 1 + K3 



1 + dnz ' 

which captures the deactivation effect noted by Oehler et al: increases with 
the repressor level until it saturates 1/d. However, the data shows no evidence 
of this saturation even if the repressor is overexpressed 90-fold (Fig.Eb). Non- 
linear regression of the data using the above expression yields the best-fit 
parameters, d = and k 3 = 0.24 for wild-type cells. It is conceivable that d 
is positive, but so small that dn 3 <C 1 for the overexpression levels shown in 
Fig. 03. Henceforth, we shall assume that d = 0, and 

^3 = 1 + « 3 , (30) 

a relation that is valid up to an overexpression level of 900 

Unlike K\ and k$, the parameter, K2, cannot be calculated from the repres- 
sion data for CVcontaining cells because they show no repression even if the 
repressor is overexpressed 90-fold. This property is implicit in the model as 
well. Indeed, (l20l) implies that 

TZ - 1 - l + K2 -i 

2 ~ Z/(0)(1 + K 2 ) ~ 1 + K 2 ~ ' 

regardless of the repressor level. Evidently, this reflects the fact that 2 -bound 
repressor does not block RNA polymerase. 



3.2 DNA looping 



In this case, the full system of eqs. fTl8l) - (PT9l) must be solved for v and p. 
Perturbation theory yields the zeroth-order solution 



1 



Po = —. — 74, (31) 



Uo 1 + pohix) + pIMx) + plhix)' (32) 

It is evident from (1321 ) that pofi(x)i Poh{x)j an d Pofoix) are the concen- 
trations of the unary, binary, and ternary operons, respectively, relative to 
the concentration of the free operons. We shall constantly appeal to this fact 
below. 



3 These estimates of Ki also provide good fits to the repression data for dimers 
(Fig. EjJ 
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The zeroth-order solution is a good approximation to the exact solution. In- 
deed, the first order solution is given by (Appendix IB1 



where 



P = p (i-Lun ) + o(u 2 ), 

u = u (l + wS%) +0(uj 2 ) 



°" l + Pofi{x)+f%h(x) + MMxY [ } 



and the relative error of u is approximately 

v — uq oo£l 2 



o 



v 1 + uQq 

The above interpretation of the terms, p l fi(x),i — 1)2,3, implies that Q is 
the average number of repressors per operon, and hence, can have any value 
between and 3. At large inducer concentrations, Q ~ 0, and the error is 
guaranteed to be vanishingly small. At low inducer concentrations, Qq can ex- 
ceed 1, provided the fraction of binary and ternary operons is sufficiently large. 
However, we show below that in wild-type cells, Qo is close to 1 (Fig. [T3h). 
In repressor-overexpressed cells, f2 can approach 3, but lj is so small that 
the relative error of u does not exceed 20% (Fig. IB.l]) . The zeroth-order so- 



lution is therefore a good approximation at all repressor levels and inducer 
concentrations. 

Substituting pB in (1521) and §F7§ with d = yields 

i/q = - , (34) 

1 -)- "1 _|_ "1 _|_ a 2 _|_ «2 _|_ Q3 



T = v Q 



"1 I "1 I "2 I "2 I "3 

«2 



1 + 



(35) 



which shows that in the presence of DNA looping, the induction rate is for- 
mally different from (|28|) . It turns out, however, that in wild-type lac, the 
parameter values are such that several terms in the above expresssions are 
negligibly small. To see this, it is useful to define 

Hx)= t: ' 2i , »= 1,2,3, (36) 
Hx)= - . : 2(m) , i = l,2, (37) 



(i + x) 



and rewrite fl34l) as 



i/ = * * • (38) 

i + Mx) + Mx) + Mx) + Mx) + Mx) ' 
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Figure 9. Estimati on of k 2 , K\ 2 K^, and by fitting the repression data from 
Oehler et all . Il990l . Table I and lOehler et all Il994l . Figs 4-5 to eqs. (1401) and (j42l). 
The values of TZ^i and IZ312 at 90-fold overexpression are lower bounds for the 
repression. The true repression levels are too high to be measured accurately. 



Evidently, 4>i{x) an d 4>i(x) are the relative concentrations of the non-looped 
and looped operons containing i repressors (measured relative to the con- 
centration of free operons). In particular, the parameters, «j = 0«(O) and 
Si = 0j(O), are the relative concentrations of these operons in the absence of 
the inducer. 



We begin by determining the wild-type values of «j and 5j. The above esti- 
mates of Ki, k 2 , and K3 imply that ai = 31, a 2 = 19, and = 3. To find the 
remaining parameters, Si, 0:2, observe that since 



K 



312 



T(0) 



1 + «i + Si + a 2 + a 2 + a 3 
1 + k 2 ' 



(39) 



the repression in cells containing pairs of operators are given by the expres- 
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siono 

^i 2 = ^ — — , 40 

1 + k 2 

1 + K 2 + K 3 + K 2 K 23 + K,2K3 ( A ~\ \ 

7132 ~ 1+^ ' (41) 

7^31 = 1 + Ki + K 3 + + Kl«3- (42) 

Fitting the repression data obtained at various overexpression levels to these 
equations yields the estimates, k 2 = 0.38, K\ 2 = 32, Ki 3 = 15, K23 = 2.5 
(Fig. [9]), which imply that 

Si = k x K 12 + K t K 13 + k 2 K 23 = 1420. 

Since the measured value of 7^312 is 1300, eq. ( 1391) implies that a 2 = 322. 

These parameter values imply that in wild-type cells, the induction rate is 
much simpler than fl3~5"|) . To see this, observe that in the absence of the inducer, 
the relative concentrations of binary and ternary operons are small compared 
to the relative concentrations of free and unary operons, i.e., 

a 2 + a 2 + a 3 < 1 + a x + Si- (43) 

Now, eqs. (I36H37I) imply that in the presence of the inducer, the relative con- 
centrations of the binary and ternary operons decrease with the inducer con- 
centration at a rate as fast, or even faster, than the corresponding rate for the 
looped unary operons. It follows that even in the presence of the inducer, the 
relative concentrations of the binary and ternary operons remain negligibly 
small compared to the relative concentrations of the unary and free operons, 
i.e., the relation 

Mx) + Mx) + Mx) < 1 + <f>(x) + Mx) 

is true for all x > 0. The fraction of free operons in wild-type lac is therefore 
well-approximated by the simpler expression 

u w ^ . (44) 

i + <Mx) + <Mx) 

A similar argument shows that in the absence of the inducer, k 2 /(1 + x) 2 , 
the relative concentration of 02-bound operons, is 0.38, and ( |35i ) implies 
that almost 1/3 of the transcription occurs from 2 -bound operons. How- 
ever, k 2 /(1 + x) 2 decreases so rapidly with the inducer concentration that it 
is already below 0.2 at % = 0.5. Thus, the transcription rate of wild-type lac 



Eq. (T40l) is the kinetic analog of eq. (T4| derived from thermodynamic principles. 
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is well-approximated by the expression 



T( X ) « MX) = — 77— ]^ /n 7 ,4 (45) 
l + ai/(l+x) +ai/(l+x) 

for all but a negligibly small range of inducer concentrations. This expression 
is simpler than (1351) . but formally different from (l28l) . The physical reason for 
this will be discussed shortly. 

The parameter values also imply that in the absence of the inducer, the relative 
concentrations of the free and non-looped unary operons are negligibly small 
compared to relative concentration of looped unary operons, i.e., 



1 + «i <C Si. 



It follows that in wild-type cells, the repression is exerted almost entirely by 
the looped unary operons, i.e., 

K U2 M M (46) 

1 + ^2 1 + ^2 

This equation explains an important trend in Table ED Specifically, the ad- 
dition of only one of the auxiliary operators to the main operator increases 
the repression dramatically (25- to 40-fold) because K u ,Ki 3 ^> 1. However, 
addition of the second auxiliary operator provokes no more than a 2- or 3-fold 
increase because the magnitudes of Kyi and Ki 3 are comparable. 



Comparison of ( 1281) and (1451) shows that the induction kinetics are qualitatively 
different in the presence of DNA looping precisely because <f>i(x) decreases 
faster than (f>i(x)- The physical reason for this is as follows. Looped unary 
states can form only if free repressor binds to an operator, whereas non-looped 
unary states can form if free or inducer-bound repressor binds to an operator. 
More precisely, eqs. ( ITOl) and (fUj) show that the relative concentrations of 
looped and non-looped unary operons are proportional to r and r + r'/2 = 
r(l + x) 2 , respectively. Since r is proportional to (1 + x)~ 4 5 0i(x) an d 4>i{x) 
decrease at the rates (1 + x)~ 4 an d (1 + x) ~ 2 ■, respectively. 

Analysis of the data confirms that DNA looping produces a qualitative change 
in the kinetics, which cannot be captured by quantitative adjustment of the 
parameters in eq. (1251) . If the data were consistent with ( |25l ). the [T/ (1 — T)] 1 ^ 2 
vs. x plots would be straight lines. However, construction of these plots for 
three different strains of E. coli yields not straight lines, but curves with 
conspicuously small slopes at low inducer concentrations (Fig. ITOh). 

The reason for the nonlinearity of the [T/ (1 — T)] 1 / 2 vs. x plot becomes evident 
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Figure 10 Analysi s of th e data for strains BB20 lac% (♦), 2001c (*), and 15TAU 
lac^ (■) (lOverathl . fl968l . Fig. 1). (a) The [T/(l - T)] 1/2 vs. x plots are not straight 
lines. The slopes decrease significantly at low inducer concentrations, (b) The 
[T/(l — T)] 1 / 4 vs. x plots are linear at low inducer concentrations. The black, red, 
and green lines are fits obtained from the data for IPTG concentrations below 20 fiM. 

Table 2 

Parameter values of eq. (145]) estimated from the induction curves for 6 different 
strains of E. col 



Strain 


K- 1 GuM) 


Si 


a\ 


Reference 


BB20 lac^ 


16.3 


1834 


62 


Overath. 1968. Fier. 1 


2001c 


26.2 


741 


12 


Overath. 1968. Fie. 1 


15 TAU lac^ 


44.2 


89 





Overath. 1968. Fie. 1 


600Co c yf 


17.5 


13 





Overath. 1968. Fie. 1 


W3102i* 


3.0 


66 


7 


Gilbert and Miiller-Hill. Fie. 1 


BMH8117 AEwtl23 


10.9 


4921 


219 


Oehler et a,l.. 2006. Fie'. 1A 



if eq. ( 1451) is rewritten as 



1 _ i - 



+ 



«1 



\2 ■ 



(i + xT (i + x) 

Since Si ~ 50«i in wild-type lac, the first term, which accounts for the repres- 
sion due to looped unary operons, dominates at sufficiently low inducer con- 
centrations, x ^ \JS\/ai — 1 pa 6. At these low concentrations, [T/ (1 — T)] 1 / 4 
vs. x plots should be straight lines because 



T 



1 — T 



i/i 



a 



1/4 



+ 



1/4 



X. 



The experimental data for 3 different strains of E. coli shows that this is 
indeed the case (Fig. [T0b). To be sure, the [T/(l — T)] 1 / 2 vs. x plots are also 
straight lines at sufficiently large inducer concentrations (Fig. [T0k). This is 
because when x ^ ySiToi — 1, the non-looped unary states dominate, so 
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Figure 11. Fits of the data from Qverath . 19681 and Gilbert and Muller-Hilll . 19661 
for: (a) BB20 iocg (♦), 2001c (*), and 15TAU lac^ (■). (b) Operator-constitutive 
strain 600Co c yf (♦), and tight-binding strain W3102 (if). The data was fitted with 
eq. (I45j) and the parameter values in Table El 



that 



T 



1 — T 



1/2 



a 



1/2 



+ 



1/2 



X. 



However, neither plot can be linear over the entire range of inducer concen- 
trations. 



Eq. (1451) provides good fits to the experimental data (Figs. [5b and [TTTl. The 
parameter values for these fits, shown in Table [2], were estimated as follows. If 
sufficient data was available at low inducer concentrations (Fig.[TT|), aq and K x 
were estimated from the slopes and intercepts of the [T/(l — T)] 1 / 4 vs. x plots. 
The value of aq was then determined by one-parameter nonlinear regression of 
the data (MATLAB, LSQNONLIN). If accurate data was not available at low 
concentrations (Fig.[5h), all three parameter values were obtained by nonlinear 
regression of the data. 

In wild-type cells, the binary and ternary operons were neglected by ap- 
pealing to (I36l)-(l37l) and ( |43l h The latter relation is not valid for repressor- 
overexpressed cells. This is because azj,cij are proportional to (r t y . Hence, as 
the repressor level increases, 02,02,03 increase much faster than aq,3q, and 
at sufficiently large repressor levels, 

cv 3 > a 2 , a 2 > aq, Si > 1, (47) 

i.e., almost all the operons are in the ternary state. Fig. [T2a shows that in the 
absence of the inducer, Q f& 1 in wild-type cells, but increases to ~3 in cells 
containing ~500 times the wild-type repressor levels. In vitro data provides 
direct evidence of this increase in Qq. When DNA fragments, containing two 
appropriately spaced lac operators, are exposed to increasing repressor lev- 
els, there is a perceptible increase in the concentration of binary non-looped 
complexes (Fig.[T2b). In vivo data also suggests that fl increases in repressor- 
overexpressed cells. Oehler et al found similar repression levels in two different 
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Figure 12. The number of repressors per operon increases with the fold-increase in 
repressor level relative to the wild-type repressor level, (a) The model prediction 
calculated from (|33j) assuming x = and aj, Sj have wild-type values, (b) When 
DNA fragments with two lac operators are exposed to increasing repressor levels 
(lanes b-e), the concentra tion of binary non-looped fragments increases progres- 



sively (|Oehler et all Il990l . Fig. 4). The symbols on the left show the structures 
of the fragments (unary looped at the top, followed by binary non-looped, unary 
non-looped, and free fragments). 

strains of E. coli containing high levels (900 molecules per cell) of the wild- type 



tetrameric and mutant dimeric repressor, respectively ( lOehler et all 119901 . Ta- 
ble I). They argued that this is because at such high repressor levels, most 
of the operons are in the ternary state. Since ternary operons cannot form 
loops even in cells containing the tetrameric repressor, the repression levels 
are similar in both cell types. More precisely, ((47j) and (IA.3I) imply that 

1 



^312 


dimer 


7^-312 


tetramer 



(1 + aj2 + a 2 /A + q 3 /8) / (1 + k 2 /2) _ 
(1 + «i + Si + a 2 + a 2 + a 3 ) /(l + k 2 ) 4 



The experimentally observed value of this ratio is higher (0.5) possibly be- 
cause at such high tetrameric repressor levels, the repression is too high to be 
measur ed accurately. The m easured value of the repression is, at best, a lower 



bound (lOehler et allll994 Fig. 5). 



It is therefore clear that in repressor-overexpressed cells, binary and ternary 
operons are dominant in the absence of the inducer. We expect that they will 
remain dominant at sufficiently small inducer concentrations. This becomes 
evident if we plot the fractions of various states of the operon as a function 
of the inducer concentration. The fractions of non-looped and looped operons 
containing i repressors are given by 

Oiix) = — > (48) 

i + Mx) + Mx) +^Mx) + Mx) + Mx) ' ' 

Q.(y) = ti^l (49) 

lK i + Mx) + Mx) + Mx) + Mx) + Mx) ' 

The fraction of free operons, which is precisely u, is given by ( |38l) . In wild-type 
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Figure 13. Distribution of the fractions of various states as a function of the inducer 
concentration: (a) Wild-type cells. (b,c) Repressor-overpressed cells with 90-fold 
overexpression. In (a, b), the black curve represents the fraction of binary and 
ternary operons; the red, green, and blue curves represent the fractions of looped 
unary, non-looped unary, and free operons, respectively. In (c), the full, dashed, and 
long-dashed lines denote the fractions of ternary, looped binary, and non-looped 
binary operons, respectively. 

cells, the fraction of binary and ternary operons, (9 2 + 2 + 6> 3 ), is small at all 
inducer concentrations (FiglT3~a. black curve). In repressor-overexpressed cells 
with 90-fold overexpression, this fraction is dominant for all x < 5 (Fig [13b, 
black curve). If we plot the individual components, #2, #2, #3, of this fraction, 
it becomes clear that the ternary and binary looped operons are dominant 
f° r X ^ 3 (Fig lT3"b). It follows that the kinetics of repressor-overexpressed 
cells cannot be captured by eq. (1451) — it is necessary to use the more general 
expression (1351) . 

We tested the validity of the model by determining the extent to which it 
could fit the induction curves for cells containing wild-type repressor levels 
(Fig. [TT]). The fits do not prove the validity of the model because these in- 
duction curves show the variation of only one of the model variables — the 
fraction of free operons — as a function of the inducer concentration, . If the 
model is truly valid, the fraction of every looped and non-looped species will 
vary in a manner consistent with the model. It is therefore particularly useful 
that these fractions follow simple scaling relations, which are experimentally 
testable because each fraction migrates at a different speed in polyacrylamide 
gel electrophoresis (Fig. fT2b). To see this, note that there are three distinct 
trends in Figs.fT3b.c: (a) The fraction of free operons increases monotonically, 
(b) the fractions of ternary and looped binary operons decrease monotoni- 
cally, and (c) the fractions of the remaining three states of the operon pass 
through a maximum. These trends follow immediately from the definitions 
(l48l)-(l49l). They are similar to the concentration profiles observed in series re- 
actions (A — > B — > • ■ ■ ), wherein as time progresses, the concentration of the 
first (resp., last) component decreases (resp., increases) monotonically, and 
the concentrations of the intermediate components pass through a maximum. 
In Figs. [TSb.c. the inducer concentration plays a role analogous to time: As x 
increases, the ternary operons are successively converted to binary, unary, and 
free operons. But there is an important difference. Since 02 and 03 decrease 
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corresponds to E. coli BMH8117 AEwtlOO (♦), which contains only the main oper- 
ator, and E. coli 15TAU lac 2 (*), W3102 (■) which contain all three operators. 



with x a t the same rate, the model predicts that the ratio, 9 2 /9 3 , has the same 
value, 02/0(3, at all inducer concentrations. Similarly, the ratio, 6\/6 2 , must 
have the same value, ai/a 2 , at all inducer concentrations. These scaling rela- 
tions were obtained by varying the inducer concentrations at fixed repressor 
levels. If the repressor levels are changed at fixed inducer levels, say, x = 
(Fig.[T2b). the model predicts that Qi/0i will have the same value, oti/ai, at all 
repressor levels. Experimental tests of these scaling relations provide a strin- 
gent test of the model. Furthermore, deviations from these scaling relations 
may reveal the untenable assumptions of the model. 



4 Discussion 



Given the above results, we can state the conditions under which the kinet- 
ics of lac induction can be described by eqs. (DO) and (j2j) of the Yagil & Yagil 
model. If DNA looping is weak or absent, both equations provide good approx- 
imations to the kinetics, but (pp) is valid at all inducer concentrations, whereas 
(J2j) captures the kinetics only at sufficiently large inducer concentrations. In- 
deed, the latter equation predicts that the slope of the induction curve is zero 
at small inducer concentrations. This is inconsistent with the data — the in- 
duction curve increases linearly at inducer concentrations as low as ~0.5 [iM, 
regardless of the presence or absence of DNA looping (Fig. [141) . 

In the presence of DNA looping, the kinetics of wild-type cells are more co- 
operative than the kinetics predicted by the Yagil & Yagil model, and this 
cooperativity becomes even more pronounced in repressor-overexpressed cells. 
This result has important implications for the dynamics of the lac operon. As 
we show below, it suggests that repressor overexpression can be used to induce 
bistability in systems that are otherwise bistable. 
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(a) (b) 

Figure 15. Induction of bistability by repressor overexpression. (a) In wild-type E. 
coli, there is bistability during growth on succinate + TMG (black curves). If the 
repressor levels are reduced, the induction rate becomes hyperbolic (red curve), and 
bistability disappears, (b) In wild-type E. coli, there is no bistability during growth 
on lactose (black curves). If the repressor is overexpressed, bistability is induced 
because the induction rate becomes more cooperative (red curve). 

Molecular biologists have know n for a long tim e that cooperativity plays a 



central role in genetic switches ( jPtashnd . 1 19921 . p. 28). This was conclusively 



demonstrated by recent experiments with the lac operon. Ozbudak et al in- 
serted into the chromosome of E. coli MG 1655 a single copy of a lac reporter 
gene coding for green fluorescence protein. In these cells, the green fluores- 
cence intensity provides a measure of the instantaneous activity of the lac 
enzymes. They showed that when these cells were grown exponentially on a 
medium containing succinate and the gratuitous inducer, TMG, the enzyme 
activities displayed bistability. Futhermore, this bistability could be captured 
by the steady states of the equation 

de 1 + Klx 2 s 
— r„e, x oc e- 



alt a x + 1 + Klx 2 9 ' K s + s 

where e and s denote the lac permease activity and extracellular TMG concen- 
tration, respectively; r g denotes the specific growth rate on succinate; and the 
inducer concentration, x, is assumed to be proportional to the TMG uptake 
rateJZI Bistability occurs precisely because the induction rate, which increases 
as e 2 , is more cooperative than the dilution rate, which is proportional to e 
(Fig.[T5a. black curves). Indeed, if the repressor level is decreased by "titrating" 
the repressor with the lac operator, the induction curve loses its cooperativity 
— it becomes hyperbolic (Fig. [i~5k. red curve), and the bistability disappears. 

The above example shows that bistability can be abolished by decreasing the 



5 The repression of the lac reporter gene used in this study was only 170. This is 
partly because the reporter gene lacks 02- However, the 0\,Oz interaction is also 
somewhat attenuated because 0%, 03-containing cells yield a repression of 440 (Ta- 
bled]). Given the weak DNA looping, it is conceivable that eq. J2]) approximates the 
induction kinetics. 
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repressor level, and hence, the cooperativity of the induction curve. It is there- 
fore conceivable that bistability can be imposed upon monostable systems by 
increasing the repressor level. Ozbudak et al observed that their system ex- 
hibited n o bistability if the cell s were grown on lactose, rather than succinate 



TMG (jOzbudak et al.. 120041). One hypothes i s for explaining the absence of 



bistability is as follows (INarang and Pilyuginl . 120061 ). During growth on suc- 
cinate + TMG, the specific growth rate is independent of the lac permease 
activity. In sharp contrast, during growth on lactose, the specific growth rate 
is proportional to the specific lactose uptake rate, i.e., r g oc es/(K s + s), where 
s now represents the concentration of extracellular lactose. The dilution rate 
is therefore as cooperative as the induction rate (both rates increase as e 2 ), 
and bistability is impossible (Fig. [T5b. black curves). In such systems, bista- 
bility can be induced by overexpressing the repressor because the induction 
rate then increases as e 4 or e 6 , which is significantly more cooperative than 
the dilution rate (Fig. [T5b . red curve). Thus, the increase in cooperativity 
generated by high repressor levels can be exploited to impose bistability upon 
systems that otherwise show little propensity for switch-like behavior. This 
may be useful in synthetic biology, which is concerned, among other things, 
with the development of genetic switches. 



5 Conclusions 



We formulated a model for the kinetics of lac induction which takes due ac- 
count of the tetrameric structure of the repressor, the existence of the auxiliary 
operators, and the attendant DNA looping. Analysis of the model shows that: 

(1) In the absence of DNA looping, the kinetics are given by eq. (l25l) . which 
is formally similar to the Yagil & Yagil model. In the presence of DNA 
looping, the kinetics are significantly more cooperative. 

(2) In wild-type cells, no more than one repressor binds to an operon, and the 
kinetics are given by eq. (1441) . which depends on powers of x as high as x A . 
The cooperativity increases markedly because the concentration of looped 
repressor-operator complexes decreases with the inducer concentration at 
a rate much faster than the corresponding rate for non-looped complexes. 

(3) If the repressor is overexpressed in wild-type cells, multiple repressors 
are bound to most of the operons, and the kinetics are given by eq. (l32l) . 
which depends on powers of x up to x 6 . The cooperativity is enhanced 
even further because multi-repressor operons are more sensitive to the 
inducer concentrations than operons with only one repressor. 

(4) The model provides good fits to the induction curves for 4 different strains 
of E. coli. We also show that if the model is correct, the relative concen- 
trations of certain looped and non-looped species must remain the same 
at all inducer (or repressor) concentrations. These scaling relations, which 
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lie at the heart of the model, can be rigorously tested by gel electrophore- 
sis. 



These results should be useful in analyzing kinetic data for induction of oper- 
ons involving DNA looping, and in formulating dynamic models for induction 
of such operons. 
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A Induction kinetics and repression in cells containing mutant 



Equations (TT8t ) (TT9T) were derived for cells containing the tetrameric repressor. 
If the cells contain mutant dimers that can bind to the operator but do not 
tetramerize, the corresponding equations are 



where p now denotes the fraction of free mutant dimers. These equations 
differ from eqs. (fT8l)- (fT9l) in three ways: (a) The parameters, ati, satisfy the 
relations, a% = «i/2, a 2 = 0.2/^, &3 = 013/8, since the association constants 
for dimer-operator binding are half of the corresponding association constants 
for tetramer-operator binding, (b) The first term of eq. (IA.ll) depends on 
(1 +x) 2 , rather than (1 + x) 4 , because mutant dimers have only two inducer- 
binding sites, (c) The terms in square brackets do not depend on the inducer 
concentrations because inducer-bound mutant dimers cannot bind to the op- 
erator. The latter also implies that the transcription rate is proportional to 
T = v (1 + R 2 p) , k 2 = K2/2, provided d = 0. 

The zeroth-order solution is 



dimers 



p(l + X) 2 + uv pa 1 + 2p 2 a 2 + 3p 3 a 3 = 1, 
v 1 + pa\ + p 2 a 2 + p 3 a 3 — 1. 



(A.l) 
(A.2) 



1 



(i + x) 2 ' 



1 



Vq = 



1 + po"! + Po«2 + Po"3 ' 
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(b) 



(c) 
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Figure A.l . The model predic ts the repres sion in cells contain ing mutant dimers 
(data from lOehler et allll99f)l . Table I, and lOehler et all Il994l . Figs 4-5). The full 
lines show the model predictions, calculated from eqs. (IA.3I) - (|A.6|) with the wild-type 
parameter values, K\ = 30, k 2 = 0.38, k 3 = 0.24, determined in Section [3l 



which implies that 



T 



i + R 2 /(i + x y 



1 + ax/ (1 + X) + <V (1 + x) + «s/ (1 + x) 



Although these kinetics can be highly cooperative, the parameter values for 
cells containing wild-type repressor levels are such that the corresponding 
kinetics are formally similar to eq. (1251 ). Fig. Eb shows that this equation 
provides a good fit to the induction curve of cells containing mutant dimers. 

To see this, observe that the values of a\, a 2 , «3 for cells containing wild-type 
levels of tetrameric repressor imply that a,\ = 15.5, 02 = 5, (S3 = 0.25. Since 
0:2, «3 are small compared to «i, 



«2 



«3 



(i + x ) 4 '(i + x )° "(i+xT 



a 1 



for all but a negligibly small range of inducer concentrations. The induction 
kinetics are therefore formally identical to eq. (1251) . 

The repression in cells containing all three operators is 



K 
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1 + a 1 + a 2 + a 3 
1 + k 2 



(A.3) 
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which implies that 



Tlx = 1 + ki, Tl 2 = 1, (A.4) 
^3 = 1 + k 3 , ^12 = — — , (A.5) 

l + /«2 

AC32 = : Z , ^-31 = 1 + + K 3 + (A. 6) 

1+^2 

where /2. Fig. 1AJJ shows the repression predicted by these expressions, 

assuming that Ki, k 2 , k 3 have the values estimated in Section [3] from the data 
for cells containing the tetrameric repressor. The good agreement with the 
repression data for cells containing mutant dimers suggests that the model 
and the parameter values are plausible. 



B Solution of eqs. (1181)— ( JIUT) by regular perturbation 



We wish to solve the equations 



p(l + x f + uju(pf 1 + 2p 2 f 2 + 3p 3 f 3 
v{l+ph + p 2 f 2 + p'h 



(B.l) 
(B.2) 



for small u. To this end, assume that the solutions have the form 

p = Po + uo Pi + 0(uj 2 ), (B.3) 
v = uo + upx + 0(uj 2 ). (B.4) 

Substituting these solutions in (1B.1|) - (1B.2I) . and collecting terms with like 
powers of u yields 



p x (1 + X f + ^ (po/i + 2p 2 J 2 + 3plf, 



+ . . 



0. 



^o(1 + Po/i + Po/2+Pq/j 



VoPi (fi + 2p f 2 + 3po/ 3 ) + 



It follows that 



+ M 1 + po/i + p5/2 + pS/ 3 +--- = o. 



pi 



:i+x) 



1 • 



1 + P0/1 + P0/2 + P0/3' 



P0/1 + 2p 2 f 2 + 3p 3 f 3 ) 



-VqPi 



fi + 2p / 2 + 3pg/ 3 
1 + P0/1 +P0/2 + P0/3' 



33 



0.5 




10 100 1000 

Overexpression factor 



Figure B.l. The relative error for isq does not exceed ~20%. The relative error was 
calculated assuming x = 0, u = 0.2, and a, n dti have wild-type values. 



If we define 



n n = 



A)A + 2p§/2 + 3p§/ 3 
1 + Pofi + plh + Poh 



(B.5) 



Pi and V\ can be written as 

pi = -po^o, v\ = ^o^o- 
Substituting these expressions in flB.3|) - flB.4l) yields 



p = p (l-uQ ) + O(u 2 ), 
is = v (l +O(oo 2 ). 



(B.6) 
(B.7) 



These are the first-order solutions for the general model. 



The parameter Q approximates the average number of repressors bound to 
an operon because (IB.5I ) can be rewritten as 



where 



8i,t = 



Pofi 



1 + Pofi + Pofi + Pofs ' 



z = 1,2,3. 



is the fraction of operons containing i repressors. It follows that Q must lie 
between and 3. In the absence of the inducer, fl increases with repressor 
overexpression from ~1 to 3 (Fig. [12]). However, the relative error for u does 
not exceed ~20% (Fig.EjJ. 

In the absence of the auxiliary operators, / 2 = f 3 = 0. In this case 

1 - Pofi 



vo 



1 + Pofi 



1 + Pofi 



Q = 1 - i/ - 
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Substituting this relation in (1B.6I) - (1B.7I) yields 



p = p [1-u(1-u )}+O(uj 2 ), 
v — Vq 1 + u;(1 — z/ ) 2 + 0{u 2 
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